Approximating likelihoods for large spatial data sets
نویسندگان
چکیده
Likelihood methods are often difficult to use with large, irregularly sited spatial data sets, owing to the computational burden. Even for Gaussian models, exact calculations of the likelihood for n observations require O.n3/ operations. Since any joint density can be written as a product of conditional densities based on some ordering of the observations, one way to lessen the computations is to condition on only some of the ‘past’ observations when computing the conditional densities. We show how this approach can be adapted to approximate the restricted likelihood and we demonstrate how an estimating equations approach allows us to judge the efficacy of the resulting approximation. Previous work has suggested conditioning on those past observations that are closest to the observation whose conditional density we are approximating. Through theoretical, numerical and practical examples, we show that there can often be considerable benefit in conditioning on some distant observations as well.
منابع مشابه
Spatial Design for Knot Selection in Knot-Based Low-Rank Models
Analysis of large geostatistical data sets, usually, entail the expensive matrix computations. This problem creates challenges in implementing statistical inferences of traditional Bayesian models. In addition,researchers often face with multiple spatial data sets with complex spatial dependence structures that their analysis is difficult. This is a problem for MCMC sampling algorith...
متن کاملApproximation of Multipoint Likelihoods Using Flanking Marker D
The calculation of multipoint likelihoods is computationally challenging with the exact calculation of multipoint probabilities only possible on small pedigrees and many markers or large pedigrees and few markers. This paper explores the utility of approximating multipoint likelihoods using data on markers flanking a hypothesized position of the trait locus. The calculation of such likelihoods ...
متن کاملOn Sparse Variational Methods and the Kullback-Leibler Divergence between Stochastic Processes
The variational framework for learning inducing variables (Titsias, 2009a) has had a large impact on the Gaussian process literature. The framework may be interpreted as minimizing a rigorously defined KullbackLeibler divergence between the approximating and posterior processes. To our knowledge this connection has thus far gone unremarked in the literature. In this paper we give a substantial ...
متن کاملSpatial statistics for lattice points on the sphere I: Individual results
We study the spatial distribution of point sets on the sphere obtained from the representation of a large integer as a sum of three integer squares. We examine several statistics of these point sets, such as the electrostatic potential, Ripley's function, the variance of the number of points in random spherical caps, and the covering radius. Some of the results are conditional on t...
متن کاملMultivariate scan statistics for disease surveillance.
In disease surveillance, there are often many different data sets or data groupings for which we wish to do surveillance. If each data set is analysed separately rather than combined, the statistical power to detect an outbreak that is present in all data sets may suffer due to low numbers in each. On the other hand, if the data sets are added by taking the sum of the counts, then a signal that...
متن کامل